SEQUENCE LISTING 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Akzo nobel n.v. 

(B) STREET: Velperweg 7 6 

(C) CITY: Arnhem 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP) : 682 4 BM 

(G) TELEPHONE: 0412-66637 9 

(H) TELEFAX: 0412-650592 

(I) TELEX: 37503 akpha nl 

(ii) TITLE OF INVENTION: Novel estrogen receptor 
(ill) NUMBER OF SEQUENCES: 28 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



15 



- 30 - 

ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 

5 

CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 18 0 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

10 GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 480 

ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 540 

20 AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 780 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 

3 0 TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA \020 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 108 0 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 



25 



35 
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ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 12 00 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGTAA CAAGGGCATG 12 60 

GAACATCTGC TCAACATGAA GTGCAAAAAT GTGGTCCCAG TGTATGACCT GCTGCTGGAG 132 0 

ATGCTGAATG CCCACGTGCT TCGCGGGTGC AAGTCCTCCA TCACGGGGTC CGAGTGCAGC 1380 

CCGGCAGAGG ACAGTAAAAG CAAAGAGGGC TCCCAGAACC CACAGTCTCA GTGA 14 34 
(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 

CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 18 0 

30 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

35 GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 42 0 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 480 
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ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 
CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 
GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 
ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 7 80 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 
TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 
CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 1020 

20 CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTG A 1251 
(2) INFORMATION FOR SEQ ID NO: 3: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear # 



35 



(ii) MOLECULE TYPE: peptide 



- 33 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 
15 10 15 

5 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His 
20 25 30 

Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn 
10 35 40 45 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 
50 55 60 

15 Gly Met 

65 

(2) INFORMATION FOR SEQ ID NO: 4: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 eunino acids 

(B) TYPE: 2unino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inea r 



3^ 



(ii) MOLECULE TYPE: peptide 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Leu Val Leu Thr Leu Leu Glu Ala Glu Pro Pro His Val Leu lie Ser 

15 10 15 

35 Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr 

20 25 30 

Lys Leu Ala Asp Lys Glu Leu Val His Met He Ser Trp Ala Lys Lys 

35 40 45 
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lie Pro Gly Phe Val Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu 
50 55 60 

Glu Ser Cys Trp Met Glu Val Leu Met Met Gly Leu Met Trp Arg Ser 
65 70 75 80 

lie Asp His Pro Gly Lys Leu lie Phe Ala Pro Asp Leu Val Leu Asp 
85 90 95 

Arg Asp Glu Gly Lys Cys Val Glu Gly lie Leu Glu lie Phe Asp Met 
^ 100 105 110 

Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys 
15 115 120 125 

Glu Tyr Leu Cys Val Lys Ala Met lie Leu Leu Asn Ser Ser Met Tyr 
130 135 140 

20 Pro Leu Val Thr Ala Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu /Via 

145 150 155 160 

His Leu Leu Asn Ala Val Thr Asp Ala Leu Val Trp Val lie Ala Lys 
165 170 175 

25 

Ser Gly lie Ser Ser Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu 
180 185 190 

Met Leu Leu Ser His Val Arg His Ala Ser Asn Lys Gly Met Glu His 

30 195 200 205 

Leu Leu Asn Met Lys Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu 

210 215 220 

35 Leu Glu Met Leu Asn Ala His Val Leu 

225 230 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
20 35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
50 55 60 

.95 Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 

65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 
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Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 

115 120 125 



Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 
130 135 140 



Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu lie Phe Ala 

290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly lie 
305 310 315 320 

Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 
325 330 335 



Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met lie Leu 
340 345 350 
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Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
5 370 375 380 

Val Trp Val lie Ala Lys Ser Gly lie Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 

10 Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Ser 

405 410 415 

Asn Lys Gly Met Glu His Leu Leu Asn Met Lys Cys Lys Asn Val Val 
420 425 430 

15 

Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn Ala His Val Leu Arg 
435 440 445 

Gly Cys Lys Ser Ser lie Thr Gly Ser Glu Cys Ser Pro Ala Glu Asp 
20 450 455 460 

Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin Ser Gin 
465 470 475 

25 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: cotiino acid 

30 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
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15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

5 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
10 50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 

15 Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 

85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
ICQ 105 110 

20 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 
115 120 125 

Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 

^25 130 135 140 

Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 

30 Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 

165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 

180 185 190 
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Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 



Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
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210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu lie Phe Ala 
290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly lie 
305 310 315 320 

Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met lie Leu 
340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 

355 360 365 

30 Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 

370 375 380 

Val Trp Val lie Ala Lys Ser Gly lie Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 ' 
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Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



15 GGIGAYGARG CWTCIGGITG YCAYTAYGG 2 9 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAGCCTGGSA YICKYTTIGC CCAIYTIAT 29 



(2) INFORMATION FOR SEQ ID NO: 9: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
TGTTACGAAG TGGGAATGGT GA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TTGACACCAG ACCAACTGGT AATG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
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GGTGGCGACG ACTCCTGGAG CCCG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 

GTACACTGAT TTGTAGCTGG AC 
(2) INFORMATION FOR SEQ ID NO: 13: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCATGATGAT GTCCCTGACC 
35 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

10 TCGCATGCCT GACGTGGGAC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GGCSTCCAGC ATCTCCAGSA RCAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: cDNA 



- 4 4 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGAAGCTGGC TCACTTGCTG 2 0 

5 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTTGTTCTG GACAGGGATG 20 
(2) INFORMATION FOR SEQ ID NO: 18: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
^25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
35 GCATGGAACA TCTGCTCAAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



AGCAAGTTCA GCCTGTTAAG T 



21 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 



60 



30 



ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 
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CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 
GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC ^2 4 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 



GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 



10 



- 4 6 - 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 42 0 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 

ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 780 

15 AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

20 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 1020 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

?5 GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 
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CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTC TGCCTGA 1257 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

5 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
10 20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
^ 35 40 45 

15 Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 

50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 

20 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 

25 100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 

115 120 125 

30 Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 

130 135 140 

Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



35 



Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 



Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
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180 185 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

5 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu He Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
10 225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 



15 



20 



He Ser Trp Ala Lys Lys He Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser He Asp His Pro Gly Lys Leu He Phe Ala 

290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly He 

^25 305 310 315 320 

Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 

30 Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 

340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 

355 360 365 
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Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
370 375 380 



Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin Gin Gin Ser Met 



385 



390 



395 



400 



Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 

Ser Ala 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTTGGATCCA TAGCCCTGCT GTGATGAATT ACAG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GATGGATCCT CACCTCAGGG CCAGGCGTCA CTG 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

10 (ii) MOLECULE TYPE: cDNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CACGAATCTT TGAGAACATT ATAATGACCT TTGTGCCTCT TCTTGCAAGG TGTTTTCTCA 60 

GCTGTTATCT CAAGACATGG ATATAAAAAA CTCACCATCT AGCCTTAATT CTCCTTCCTC 120 

20 

CTACAACTGC AGTCAATCCA TCTTACCCCT GGAGCACGGC TCCATATACA TACCTTCCTC 180 

CTATGTAGAC AGCCACCATG AATATCCAGC CATGACATTC TATAGCCCTG CTGTGATGAA 240 

, 2 5 TTACAGCATT CCCAGCAATG TCACTAACTT GGAAGGTGGG CCTGGTCGGC AGACCACAAG 300 

CCCAAATGTG TTGTGGCCAA CACCTGGGCA CCTTTCTCCT TTAGTGGTCC ATCGCCAGTT 360 

ATCACATCTG TATGCGGAAC CTCAAAAGAG TCCCTGGTGT GAAGCAAGAT CGCTAGAACA 420 

30 

CACCTTACCT GTAAACAGAG AGACACTGAA AAGGAAGGTT AGTGGGAACC GTTGCGCCAG 480 

CCCTGTTACT GGTCCAGGTT CAAAGAGGGA TGCTCACTTC TGCGCTGTCT GCAGCGATTA 54 0 

35 CGCATCGGGA TATCACTATG GAGTCTGGTC GTGTGAAGGA TGTAAGGCCT TTTTTA7WVG 600 

AAGCATTCAA GGACATAATG ATTATATTTG TCCAGCTACA AATCAGTGTA CAATCGATAA 660 

AAACCGGCGC AAGAGCTGCC AGGCCTGCCG ACTTCGGAAG TGTTACGAAG TGGGAATGGT 720 
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35 



GAAGTGTGGC TCCCGGAGAG AGAGATGTGG GTACCGCCTT GTGCGGAGAC AGAGAAGTGC 780 

CGACGAGCAG CTGCACTGTG CCGGCAAGGC CAAGAGAAGT GGCGGCCACG CGCCCCGAGT 84 0 

GCGGGAGCTG CTGCTGGACG CCCTGAGCCC CGAGCAGCTA GTGCTCACCC TCCTGGAGGC 900 

TGAGCCGCCC CATGTGCTGA TCAGCCGCCC CAGTGCGCCC TTCACCGAGG CCTCCATGAT 960 

GATGTCCCTG ACCAAGTTGG CCGACAAGGA GTTGGTACAC ATGATCAGCT GGGCCAAGAA 102 0 

GATTCCCGGC TTTGTGGAGC TCAGCCTGTT CGACCAAGTG CGGCTCTTGG AGAGCTGTTG 1080 

GATGGAGGTG TTAATGATGG GGCTGATGTG GCGCTCAATT GACCACCCCG GCAAGCTCAT 1140 

CTTTGCTCCA GATCTTGTTC TGGACAGGGA TGAGGGGAAA TGCGTAGAAG GAATTCTGGA 1200 

AATCTTTGAC ATGCTCCTGG CAACTACTTC AAGGTTTCGA GAGTTAAAAC TCCAACACAA 1260 

AGAATATCTC TGTGTCAAGG CCATGATCCT GCTCAATTCC AGTATGTACC CTCTGGTCAC 1320 

AGCGACCCAG GATGCTGACA GCAGCCGGAA GCTGGCTCAC TTGCTGAACG CCGTGACCGA 1380 

TGCTTTGGTT TGGGTGATTG CCAAGAGCGG CATCTCCTCC CAGCAGCAAT CCATGCGCCT 1440 

GGCTAACCTC CTGATGCTCC TGTCCCACGT CAGGCATGCG AGTAACAAGG GCATGGAACA 1500 

TCTGCTCAAC ATGAAGTGCA AAAATGTGGT CCCAGTGTAT GACCTGCTGC TGGAGATGCT 1560 

GAATGCCCAC GTGCTTCGCG GGTGCAAGTC CTCCATCACG GGGTCCGAGT GCAGCCCGGC 162 0 

AGAGGACAGT AAAAGCAAAG AGGGCTCCCA GAACCCACAG TCTCAGTGAC GCCTGGCCCT 168 0 

GAGGTGAACT GGCCCACAGA GGTCACAAGC TGAAGCGTGA ACTCCAGTGT GTCAGGAGCC ^\1A0 

TGGGCTTCAT CTTTCTGCTG TGTGGTCCCT CATTTGGTGA TGGCAGGCTT GGTCATGTAC 1800 

CATCCTTCCC TCCACCTTCC CAACTCTCAG GAGTCGGTGT GAGGAAGCCA TAGTTTCCCT 18 60 



TGTTAGCAGA GGGACATTTG AATCGAGCGT TTCCACAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Met Asp lie Lys 
1 

Asn Cys Ser Gin 
20 

Pro Ser Ser Tyr 
35 

Tyr Ser Pro Ala 

50 

Leu Glu Gly Gly 
65 

Pro Thr Pro Gly 



His Leu Tyr Ala 
100 



Asn Ser Pro Ser 
5 

Ser lie Leu Pro 



Val Asp Ser His 
40 

Val Met Asn Tyr 

55 

Pro Gly Arg Gin 
70 

His Leu Ser Pro 
85 

Glu Pro Gin Lys 



Ser Leu Asn Ser 
10 

Leu Glu His Gly 

25 

His Glu Tyr Pro 



Ser lie Pro Ser 
60 

Thr Thr Ser Pro 

75 

Leu Val Val His 
90 

Ser Pro Trp Cys 
105 



Pro Ser Ser Tyr 
15 

Ser He Tyr He 
30 

Ala Met Thr Phe 
45 

Asn Val Thr Asn 



Asn Val Leu Trp 

80 

Arg Gin Leu Ser 
95 

Glu Ala Arg Ser 
110 



Leu Glu His Thr Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val 



115 



120 



125 



Ser Gly Asn Arg Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg 
130 135 140 

Asp Ala His Phe Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His 
145 150 155 160 

Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser 
165 170 175 

lie Gin Gly His Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr 
180 185 190 

lie Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys 
195 200 205 

Cys Tyr Glu Val Gly Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys 
210 215 220 

Gly Tyr Arg Leu Val Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His 
225 230 235 240 

Cys Ala Gly Lys Ala Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg 

245 250 255 

Glu Leu Leu Leu Asp Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu 
260 265 270 

Leu Glu Ala Glu Pro Pro His Val Leu lie Ser Arg Pro Ser Ala Pro 
275 280 285 

Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys 

290 295 300 

Glu Leu Val His Met lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val 
305 310 315 320 



Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met 
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325 330 335 

Glu Val Leu Met Met Gly Leu Met Trp Arg Ser lie Asp His Pro Gly 
340 345 350 

5 

Lys Leu lie Phe Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys 
355 360 365 

Cys Val Glu Gly lie Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr 
10 370 375 380 

Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val 
385 390 395 400 

15 Lys Ala Met lie Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala 

405 410 415 

Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala 
420 425 430 

20 

Val Thr Asp Ala Leu Val Trp Val lie Ala Lys Ser Gly lie Ser Ser 
435 440 445 

Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His 
25 450 455 460 

Val Arg His Ala Ser Asn Lys Gly Met Glu His Leu Leu Asn Met Lys 
465 470 475 480 

30 Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn 

485 490 495 

Ala His Val Leu Arg Gly Cys Lys Ser Ser lie Thr Gly Ser Glu Cys 

500 505 510 



35 



Ser Pro Ala Glu Asp Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin 
515 520 525 



Ser Gin 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GTGCGGATCC TCTCAAGACA TGGATATAAA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
AGTAACAGGG CTGGCGCAAC GGTTC 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 



5 
10 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACTGGCGATG GACCACTAAA GG 22 



15 



